A High-Performance Implementation of Atomistic Spin Dynamics Simulations on x86 CPUs
نویسندگان
چکیده
Atomistic spin dynamics simulations provide valuable information about the energy spectrum of magnetic materials in different phases, allowing one to identify instabilities and nature their excitations. However, time cost evaluating dynamical correlation function $S(\mathbf{q}, t)$ increases quadratically as number spins $N$, leading significant computational effort, making simulation large systems very challenging. In this work, we propose use a highly optimized general matrix multiply (GEMM) subroutine calculate spin-spin that can achieve near-optimal hardware utilization. Furthermore, fuse element-wise operations calculation into in-house GEMM kernel, which results further performance improvements 44\% - 71\% on several relatively lattice sizes when compared implementation uses OpenBLAS, is state-of-the-art open source library for Basic Linear Algebra Subroutine (BLAS).
منابع مشابه
SSE Implementation of Multivariate PKCs on Modern x86 CPUs
Multivariate Public Key Cryptosystems (MPKCs) are often touted as future-proofing against Quantum Computers. It also has been known for efficiency compared to “traditional” alternatives. However, this advantage seems to erode with the increase of arithmetic resources in modern CPUs and improved algorithms, especially with respect to Elliptic Curve Cryptography (ECC). In this paper, we show that...
متن کاملAtomistic spin model simulations of magnetic nanomaterials.
Atomistic modelling of magnetic materials provides unprecedented detail about the underlying physical processes that govern their macroscopic properties, and allows the simulation of complex effects such as surface anisotropy, ultrafast laser-induced spin dynamics, exchange bias, and microstructural effects. Here we present the key methods used in atomistic spin models which are then applied to...
متن کاملVisualization in atomistic and spin simulations
Although I don't have a magic solution to the work involved in raising children (although I rather like my results and theirs), I can describe some good solutions for atomistic and spin visualization. Certain basic demands from visualiza-tion code are common to all atomistic or spin systems; some are also relevant to more general visualization needs. Many commercial and homegrown visualiza-tion...
متن کاملMARSSx86: A Full System Simulator for x86 CPUs
We present MARSS, an open source, fast, full system simulation tool built on QEMU to support cycle-accurate simulation of superscalar homogeneous and heterogeneous multicore x86 processors. MARSS includes detailed models of coherent caches, interconnections, chipsets, memory and IO devices. MARSS simulates the execution of all software components in the system, including unmodified binaries of ...
متن کاملHigh-performance Physics Simulations Using Multi-core CPUs and GPGPUs in a Volunteer Computing Context
This paper presents two conceptually simple methods for parallelizing a Parallel Tempering Monte Carlo simulation in a distributed volunteer computing context, where computers belonging to the general public are used. The first method uses conventional multi-threading. The second method uses CUDA, a graphics card computing system. Parallel Tempering is described, and challenges such as parallel...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computer Physics Communications
سال: 2023
ISSN: ['1879-2944', '0010-4655']
DOI: https://doi.org/10.1016/j.cpc.2023.108851